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Abstract 

Let / := (/ , . . . , f n ) be a sparse random polynomial system. This 
means that each f % has fixed support (list of possibly non-zero coeffi- 
cients) and each coefficient has a Gaussian probability distribution of 
arbitrary variance. 

We express the expected number of roots of / inside a region U 
as the integral over U of a certain mixed volume form. When U = 
(C*) n , the classical mixed volume is recovered. 

The main result in this paper is a bound on the probability that 
the condition number of / on the region U is larger than 1/e. This 
bound depends on the integral of the mixed volume form over U, and 
on a certain intrinsic invariant of U as a subset of a toric manifold. 

Polynomials with real coefficients are also considered, and bounds 
for the expected number of real roots and for the condition number 
are given. 

The connection between zeros of sparse random polynomial sys- 
tems, Kahler geometry, and mechanics (momentum maps) is dis- 
cussed. 
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1 Introduction 



A complexity theory of homotopy algorithms for solving dense systems of 
polynomial equations was developed in ||SS93b| , |SS93a| , [5S93c| , |SS96| , |SS94j . 



(see also |PCSS98] , Ch. 8-14]). The number of steps for these homotopy 
algorithms was bounded in terms of a condition number and the Bezout 
number. 

One of the main features of that theory was unitary invariance: the roots 
of a dense system of polynomial equations are points in projective space, and 
all the invariants of the theory are invariant under the action of the unitary 
group. However, unitary action does not preserve sparse coefficient structure. 

In this paper, we give the one-distribution of the roots of random sparse 
polynomial systems. We also bound the probability that the condition num- 
ber of a random sparse polynomial system is large. 

The roots of a sparse polynomial system are known to belong to a certain 
toric variety. However, in order to obtain the theorems below, we needed 
to endow the toric variety with a certain geometrical structure, as explained 
below. The main insight comes from mechanics, and from symplectic and 
Kahler geometry. 



1.1 Expected Number of Roots 

Let A be an M x n matrix, with non-negative integer entries. To the matrix 
A we associate the convex polytope Conv(A) given by the convex hull of all 
the rows, {A a } a&{1: _ jM} , of A: 

{M M ~\ 

Y,t a A a : < t a < 1, = 1 \ c ( R T • 

a=l a=l J 

Here, we use the notation X y to denote the dual of a vector space X. 
There are deep reasons to write Conv(A) as a polytope in dual space, as the 
reader will see later on. 

Assume that dim(Conv(74)) = n. Then we can associate to the matrix A 
the space Ta of polynomials with support contained in {A a : 1 < a < M}. 
This is a linear space, and there are many reasonable choices of an inner 
product in Ta- 

Let C be a diagonal positive definite M x M matrix. Its inverse C~ x is 
also a diagonal positive definite M x M matrix. This inverse matrix defines 
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the inner product: 

The matrix C will be called the variance matrix. This terminology arises 
when we consider random normal polynomials in Ta with variance C aa for 
the a-th coefficient. We will refer to these randomly generated functions as 
random normal polynomials, for short. 

We will also produce several objects associated to the matrix A (and to 
the variance matrix C). The most important one for this paper will be a 
Kahler manifold (T n , Ua, J)- This manifold is a natural "phase space" for 
the roots of polynomial systems with support in A. It is the natural phase 
space for the roots of systems of random normal polynomials in {Ta-, (•, ^c- 1 )- 

More explicitly, let T n = C n (mod 2^v^/^lZ n ) (which, SIS db real mani- 
fold, happens to be an n-fold product of cylinders). Let exp : T n — > (C*) n 
denote coordinatewise exponentiation. Then we will look at the preimages 
of the roots of a polynomial system by exp. We leave out roots that have 
one coordinate equal to zero and roots at infinity. The differential 2-form uoa 
corresponds to the pull-back of the canonical 2-form in a suitable Veronese 
variety (see Section 0). 

Systems where all the polynomials have the same support are called un- 
mixed. The general situation {mixed polynomial systems), where the polyno- 
mials may have different supports, is of greater practical interest. It is also a 
much more challenging situation. We shall consider systems of n polynomi- 
als in n variables, each polynomial in some inner product space of the form 
{Ta -i {■, ^c- 1 ) (where i — 1, • • • , n and each A, and each are as above). 

i 

In this realm, a mathematical object (that we may call a mixed man- 
ifold) seems to arise naturally. A mixed manifold is an {n + 2)-tuple 
{T n , ujau ■ ■ ■ ,UA n ,J) where for each i, {T n ,UAi, J) is a Kahler manifold. 
Mixed manifolds do not have a natural canonical Hermitian structure. They 
have n equally important Hermitian structures. However, they have one 
natural volume element, the mixed volume form, given by 

(_l)n(n-X)/2 

dT n = \ J cj Al A---Au An ■ 

n\ 

As explained in [ |Gro90|| , the volume of T n relative to the mixed volume 
form is (up to a constant) the mixed volume of the n-tuple of polytopes 
(Convex),- •■ ,Conv(A0). 
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We extend the famous result by Bernshtein 
roots of mixed systems of polynomials as follows: 



Ber76 



on the number of 



Theorem 1. Let Ai, ■ ■ ■ ,A n and C\, ■ ■ ■ , C n be as above. For each i = 
1, • • • , n, let fi be an (independently distributed) normal random polynomial 
in (JF4 , (•, -) c -i). Let U be a measurable region of T n . Then, the expected 
number of roots of the polynomial system f(z) =0 in exp U C (C*) n is 




Example 1. When each fa is dense with a variance matrix Cj of the form: 



Ci = Diag 



deg/i 



h\h\-~ ,/»!(deg/i-E?=i^)» 



the volume element dT n becomes the Bezout number \\ deg fi times the pull- 
back to T n of the Fubini-Study metric. We thus recover Shub and Smale's 



stochastic real version of Bezout's Theorem [3S93al. 



The general unmixed case [A\ = ■ ■ ■ = A n , C\ = ■ ■ ■ = C n ) is a particular 
case of Theorem 8.1 in [|EK95|| . This is the only overlap, since neither theorem 
generalizes the other. 

On the other hand, when one sets U = T n , one recovers Bernshtein's 
first theorem. The quantity 7T~ n j T „ dT n is precisely the mixed volume of 
polytopes A\, ■ ■ ■ ,A n (see ||SY93|| for the classical definition of Mixed Volume 



and main properties). 

A version of Theorem [I] was known to Kazarnovskii ||Kaz81| , p. 351] and 
Khovanskii. In [|Kaz81|| , the supports A4 are allowed to have complex expo- 
nents. However, uniform variance (Ci = I) is assumed. His method may 
imply this special case of Theorem [3], but the indications given in ||Kaz81| 
were insufficient for us to reconstruct a proof. 

The idea of working with roots of polynomial systems in logarithmic 
coordinates seems to be extremely classical, yet it gives rise to interesting 
and surprising connections (see the discussions in [ MZa , MZb , VirOOf ). 
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1.2 The Condition Number 



Let T = J r A 1 x • • ■ x J 7 An j an d let / G -T 7 - A root of / will be represented by 
some p + q\/ — 1 G T n . (Properly speaking, the root of / is exp(p + gV — !))• 
In this discussion, we assume that the "root" p+g^/~T is non-degenerate. 
This means that the derivative of the evaluation map 

ev: fx? ^ C" 

(f,p + q\f-L) i-> (/ oexp)(p + <?\/-L) 



with respect to the variable in T n at the point p + qy—l has rank In. We are 
then in the situation of the implicit function theorem, and there is (locally) 
a smooth function G : T — > T n such that for / in a neighborhood of /, we 
have ei>(/, = and G(f) =p + qV^T. 

The condition number of / at (p + q\/—l) is usually defined as 

»(f;p + qV^T) = \\DG 



/II • 

This definition is sensitive to the norm used in the space of linear maps 
between tangent spaces LiTfJ 7 , T^ q yT n ). In general, one would like to use an 
operator norm, related to some natural Hermitian or Riemannian structure 
on T and T n . 

In the previous Section, we already defined an inner product in each 
coordinate subspace Tah given by the variance matrix Cj. Since the evalu- 
ation function is homogeneous in each coordinate, it makes sense to projec- 
tivize each of the coordinate spaces JF4. (with respect to the inner product 
(-, -} c -i). Alternatively, we can use the Fubini-Study metric in each of the 

i 

^Ai's- By doing so, we are endowing T with a Fubini-like metric that is 
scaling- invariant. We will treat T as a multiprojective space, and write 
P(JF) for P^J x ••• XP(JF A J. 

Another useful metric in P(jF) is given by 
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Each of the terms in the sum above corresponds to the square of the sine 
of the Fubini (or angular) distance between f l and g l . Therefore, dp is never 
larger than the Hermitian distance between points in J 7 , but is a correct 
first-order aproximation of the distance when g — > / in P(JF). (Compare 
with ||BOSS98| , Ch. 12]). 
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While T admits a natural Hermitian structure, the solution-space T n 
admits n possibly different Hermitian structures, corresponding to each of 
the Kahler forms u Ai . 

In order to elucidate what the natural definition of a condition number for 
mixed systems of polynomials is, we will interpret the condition number as 
the inverse of the distance to the discriminant locus. Given p + q\J — 1 e T n , 
we set: 

T M = {f e T : ev(f; (p,q)) = 0} 

and we set as the space of degenerate polynomial systems in F^q). 

Since the fiber T( P)q ) inherits the metric structure of J 7 , we can speak of the 
distance to the discriminant locus along a fiber. In this setting, Theorem 3 
in ||BCSS98| , p. 234] becomes: 



Theorem 2 (Condition number theorem). Under the notations above, 
if (p, q) is a non- degenerate root of f , 

max min \\DG f f\\ Ai < * < max max \\DG f f\\ Ai . 

Il/ll<i 1 ar{j,±>(p,q)) ||/||<i 1 

There are two interesting particular cases. First of all, if A\ — ■ ■ ■ — A n 
and C\ = ■ ■ ■ = C n , we obtain an equality: 

Corollary 2.1 (Condition number theorem for unmixed systems). 

Let A\ — • • ■ — A n and C\ — ■ ■ ■ — C n , then under the hypotheses of Theo- 
rem [| ; 

def -1 

/*(/; (P, 0)) = max mm \\DG f f\\ Ai = max max \\DG f f\\ A . = . 

Il/ll<i 1 1 ||/|[<i a P (f } h {m) ) 

We can also obtain a version of Shub and Smale's condition number the- 
orem (Theorem 3 in ||BCSS98| , p. 243]) for dense systems as a particular case, 



once we choose the correct variance matrices: 

Corollary 2.2 (Condition number theorem for dense systems). 

Let d\, ■ ■ ■ ,d n be positive integers, and let Ai be the n-columns matrix having 
all possible rows with non-negative entries adding up to at most di. Let 

r 1 tv ^ dA 
Cj = — JJiaa 



dt ta I (AtfKAW ■ ■ ■ - ELi(A)")! 
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Then, 



def ' ' ' 

(P> ^)) = max min ||DG//||^ = max max ||£>G//||^ = . 

Il/ll<i 1 1 ll/ll<i apU,^(p, g )J 

The factor j- in the definition of the variance matrix Gj corresponds to the 
factor \fdi in the definition of the normalized condition number in [[BCSS98 



p. 233]. It scales the Kahler forms u Ai so that they are equal (see Remark | 
p. [L5| below). 

In the general mixed case, we would like to interpret the two "minmax" 
bounds as condition numbers related to some natural Hermitian or Finslerian 
structures on T n . See Section || for a discussion. 

Theorem is very similar to Theorem D in ||Ded96|| , but the philosophy 



here is radically different. Instead of changing the metric in the fiber ^F(p tq ), 
we consider the inner product in JF as the starting point of our investigation. 
Theorem |2] gives us some insight about reasonable metric structures in T n . 

As in Theorem [I], let U be a measurable set of T n . In view of Theorem 
we define a restricted condition number (with respect to U) by: 

/*(/: n i: 1 



where the distance dp is taken along the fiber Ffaq) ={/:(/ exp)(p + 
'/\~) =0}. 

Although we do not know in general how to bound the expected value of 
fx(f; T n ), we can give a convenient bound for U) whenever U is compact 
and in some cases where U is not compact. 

The group GL(n) acts on T( P)g )T n by sending (p, q) into (Lp, Lq), for any 
L G GL(n). In more intrinsic terms, J and the GL(n)-action commute. With 
this convention, we can define an intrinsic invariant of the mixed structure 
(T n ,w Al ,--- ,uJ An ,J)- 

Definition 1. The mixed dilation of the tuple (w^, • • • , 00 A n ) i s: 

/ / u def . max M=1 (uj Ai )( pq) (Lu, JLu) 

k[u Ai ,--- ,u An ;(p,q)) = mm max . — — — . 

LeGL(n) 1 mm M=1 (u A J M (Lu, JLu) 

Given a set U, we define: 



k v = sup k(u Ai , ■ ■ ■ ,u An ; (p,q)) 
(p,g)eu 
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provided the supremum exists, and kjj = oo otherwise. 

We will bound the expected number of roots with condition number fi > 
e^ 1 on U in terms of the mixed volume form, the mixed dilation ku and the 
expected number of ill-conditioned roots in the linear case. The linear case 
corresponds to polytopes and variances below: 



A 



Lin 



c 



Lin 



Theorem 3 (Expected value of the condition number). Let 

u Lm (n,e) be the probability that a random n-variate linear complex 
polynomial has condition number larger than e^ 1 . Let v A {XJ, e) be the 
probability that fi(f, U) > e^ 1 for a normal random polynomial system f 
with supports Ax, - - - ,A n and variance Cx, • • • ,C n . 
Then, 

lu^Ai Tin, 



u A {U,e)< J f u " ^ v hm (n,y^e) . 
iul\ ^Ay™ 

There are a few situations where we can assert that kjj = 1. For instance, 

Corollary 3.1. Under the hypotheses of Theorem^, if A = Ax = ■ ■ ■ = A n 

and C = Cx = ■ ■ ■ = C n , then 

v A {U,e) < Vol(U) v Un (n,e) . 

The dense case (Theorem 1 p. 237 in ||BCSS98| ) is also a consequence of 
Theorem || 

Remark 1. We interpret z/ Lin (n, e) as the probability that a random linear 
polynomial / is at multiprojective distance less than e from the discriminant 
variety Let g G £(p, g ) be such that the following minimum is attained: 



Ae(c*) n l ~ 
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Without loss of generality, we may scale g such that Ai = 
that case, 



= A n = 0. In 



(p,q)j 



E 



Wf'-g* 
\\f\\ 2 



> 



i\\2 



The right hand term is the projective distance to the discriminant variety 
along the fiber, in the sense of |BCSS98f. Since we are in the linear case, this 



may be interpreted as the inverse of the condition number of / in the sense 
of |BC55gg , Prop. 4 and Remark 2 p. 250]. 

Recall that each f l is an independent random normal linear polynomial of 
degree 1, and that Cj is the identity. Therefore, each f l a is an i.i.d. Gaussian 
variable. If we look at the system / as a random variable in p n ( n + 1 )~ 1 ) then 
we obtain the same probability distribution as in [ BCSS98 ]. Then, using 
Theorem 6 p. 254 ibid, we deduce that 



.Lin , 



V 



n,e) < 



n 3 (n + l)T(n 2 + n) 
T(n 2 + n-2) 



1.3 Real Polynomials 



Shub and Smale showed in | jSS93a|| that the expected number of real roots, in 
the dense case (with unitarily invariant probability measure) is exactly the 
square root of the expected number of roots. 

Unfortunately, this result seems to be very hard to generalize to the un- 
mixed case. Under certain conditions, explicit formulae for the unmixed case 
are available ||Roj96|| . Also, less explicit bounds for the multi-homogeneous 
case were given by [[McLOO . 



Here, we will give a very coarse estimate in terms of the square root of 
the mixed volume: 

Theorem 4. Let U be a measurable set in W n , with total Lebesgue volume 
\{U). Let A\, ■ ■ ■ ,A n and C±, ■ ■ ■ ,C n be as above. Let f be a normal ran- 
dom real polynomial system. Then the average number of real roots of f in 
expU C {Rt) n is bounded above by 



(4vr 2 )-"/ 2 A A(Z7y, 



n\dT n 
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This is of interest when n and U are fixed. In that case, the expected 
number of positive real roots (hence of real roots) grows as the square root 
of the mixed volume. 

It is somewhat easier to investigate real random polynomials in the un- 
mixed case. 

Let Vm.(n, e) be the probability that a linear random real polynomial has 
condition number larger than e^ 1 . 

Theorem 5. Let A = A 1 = ■ ■ ■ = A n and C = C 1 = ■ ■ ■ = C n . Let U C W l 

be measurable. Let f be a normal random real polynomial system. Then, 

Prob [/*(/, Z7) > a' 1 } < E(U) u R (n,e) 
where E(U) is the expected number of real roots on U . 

Notice that E(U) depends on C. Even if we make U = K n , we may still 
obtain a bound depending on C . 
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2 Symplectic Geometry and Polynomial Sys- 
tems 



2.1 About Symplectic Geometry 

Definition 2 (Symplectic structure). Let M be a manifold. A 2-form 
on M is said to be non-degenerate if and only if for all x G M, the only 
vector u G T^M such that for all v G T X M, u> x (u, v) = is the zero vector. 

A symplectic form on M is a closed, non-degenerate 2-form u on M. In 
that case, (M, o>) is said to be a symplectic manifold. 



Definition 3 (Complex structure). Let M be a complex manifold. (We 
assume that M is given with a certain maximal holomorphic atlas). If X : 
U C C n -> M is a chart of M, and p = X(z) E M, then we define J p : 
T P M — > T P M such that the following diagram commutes: 



T P M — T P M 



Mult, by 



T,C n > T z C n 



This is well-defined for each p in M. Indeed, if two charts X and F 
overlap at p, then F o X^ 1 : C n — > C n is holomorphic so its derivative exists 
and commutes with multiplication by \J — 1. 

The map 

J: TM -> TM 
(P,P) *-> (P,J P P) 

is called the canonical complex structure of M. (The complex structure may 
depend on the holomorphic atlas. We assume a canonical holomorphic atlas 
of M is given). Note that — J 2 is the identity on TM. 



Definition 4 (Kahler manifolds). Let M be a complex manifold, with 
complex structure J. Let wbea symplectic form on M (considered as a real 
manifold). Then u is said to be a (1, 1) -form if and only if J*u = uo. A (1, 1) 
form u corresponds to a symmetric form u,vh lv(u, Jv). We say that uj is 
strictly positive if and only if the corresponding symmetric form is positive 
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definite for all p £ M. Therefore, a strictly positive (1, l)-form defines a 
Riemann structure on M. Also, we can recover an Hermitian structure on 
M by setting (u,v) = u>(u, Jv) + V - lou(u, v). 

The triple (M,u>, J) is said to be a Kahler manifold when M, uj and J 
are as above. 

Example 2 (C M ). We identify C M to R 2M and use coordinates Z i = 
X 1 + \f^AY % . The canonical 2-form uz = Ei^i A dYi makes C M into a 
symplectic manifold. 

The natural complex structure J is just the multiplication by The 
triple (C m ,ujz, J) is a Kahler manifold. ■ 

Example 3 (Projective space). Projective space P M_1 admits a canon- 
ical 2-form defined as follows. Let Z = (Z 1 ,--- ,Z M ) e (C M )*, and let 
[Z] = (Z 1 : • • • : Z M ) e P M_1 be the corresponding point in P M -\ The 
tangent space T[2-]P M_1 may be modelled by Z 1 - C TzC M . Then we can 
define a two-form on P M_1 by setting: 

U[z\(u,v) = \\Z\\~ 2 lj z (u,v) , 

where it is assumed that u and v are orthogonal to Z. The latter assumption 
tends to be quite inconvenient, and most people prefer to pull io\z\ back to 
C M by the canonical projection ir : Z i— > \Z\. It is standard to write the 
pull-back t = ir*u![z] as: 

T Z = -^dJ*dUog\\Z\\ 2 , 

using the notation dri = V,- — A cipj + ^ A dqi, and where J* denotes the 
pull-back by J. 

Projective space also inherits the complex structure from C M . Then U[z] 
is a strictly positive (1, l)-form. The corresponding metric is called Fubini- 
Study metric in C M or C^ 7 " 1 . ■ 

Remark 2. Some authors prefer to write \/—ldd instead of —^dJ*d. The 
following notation is assumed: dr] = J2i ^r^dZi and drj = J2i ^AdZi. Then 
they write t z as: 

_ v^T / /Zi dZj A dZ t Ei ZjdZj A ZjdZj 
TZ 2 I ||Z|| 2 \\Z\\ 4 
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Example 4. Let A be an M x n matrix with non-negative integer entries, 
and we associate every row A a of A to the monomial z A " = z l 1 ■ ■ ■ Zn" ■ We 
also assume (as in the introduction) that the corresponding polytope (the 
convex hull of all the rows) is n-dimensional. Also, as in the introduction, let 
C be a diagonal positive-definite matrix (that we called the variance matrix). 
The variance matrix was the matrix of the inner product in Ta- Let C 1 ^ 2 
be the diagonal positive-definite matrix such that C = C x l 2 C l l 2 . The right- 
multiplication of some / G Ta by C~ 1//2 makes the inner product canonical. 

The left-multiplication by C 1 / 2 is the pull-back of this operation in dual-space 
-r-v 

r A . 

We define the map Va from C n into C : 

Because C 1//2 is diagonal, ||Va(;z)|| is invariant by the natural action Z{ i— > 
Zie 9l ^~^ of (S* 1 ) 71 on the variable z. Moreover, we still have the pairing f(z) = 
(f ' -C~ 1 I 2 )-Va{z). The variable (f-C^ 1 ^ 2 ) is corresponds to Mj i.i.d. Gaussian 
variables. 

We can also compose with the projection into projective space, Va = 
7t o V A : C n -> F M ~\ 

V A : z^[V A (z)} . 

When C is the identity, the Zariski closure of the image of Va is called 
the Veronese variety. The map Va is called the Veronese embedding. Notice 
that Va is not defined for certain values of z, like z — 0. Those values are 
called the exceptional set. The exceptional set is contained in the union of 
the planes Zj = 0. 

There is a natural symplectic structure on the closure of the image of 
Va, given by the restriction of the Fubini-Study 2-form. We will see below 
(Lemma [[]) that DVa has rank n for z G (C*) n , because the polytope of A 
has dimension n. Thus, we can pull-back this structure to (C*) n by: 

Also, we can pull back the complex structure of P A/-1 , so that Qa becomes 
a strictly positive (1, l)-form. 

Therefore, the matrix A defines a Kahler manifold ((C*) n , Q A , J)- B 
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Example 5. Let T n = C" (mod 27T\/ — 1 Z n ). We will use coordinates p + 
q^f-\ for T n , where p G W 1 and g G T n = M n (mod 27rZ n ). 

Given (p, g) G T™, we define exp(p, g) = (• • • , e p+9V ^ T , • • • ) G (C*) n . 

For any matrix A as in the previous example, we can pull-back the Kahler 
structure of ((C*) n , f^, J) to obtain another Kahler manifold (T n , to% 
(Actually, it is the same object in logarithmic coordinates, minus points at 
"infinity".) An equivalent definition is to pull back the Kahler structure of 
the Veronese variety by 

a def " _ 

v a = Va° exp . ■ 



Remark 3. The Fubini-Study metric on C M was constructed by applying 
the operator —^dJ*d to a certain convex function (in our case, |log H^H 2 ). 
This is a general standard way to construct Kahler structures. In ||Gro90|| , it 
is explained how to associate a (non-unique) convex function to any convex 
body, thus producing an associated Kahler metric. ■ 



Remark 4. Now a little bit of magic... uja = v*t = v*(~^dJ*d)g, where 
g : Z i— > | log \\Z\\ 2 . Both d and J commute with pull-back, so 



u A = -v*(-dJ*d)g 
= -(^dJ*d)v* A g 
= -(-dJ*d)(gov A ) 



Remark 5. The same is true for ((C*) n , Qa, J)- A particular case should 
be mentioned here. Unitary invariance played an important role in |[SS93I)| . 
[5S93a| , |SS93cj , |SS96| , |SS94|| and in ||BCSS98|1 . Let us now recover that invari- 
ance for dense polynomials. 

Suppose that the rows of our matrix A are the exponent vectors of 
all possible monomials of degree exactly D in n + 1 variables. Let A a = 

[/!,-•-, I n +i). We set C = Diag ( -, <T , D! , . Then 



h\h\ 



'n + l 



g°V A 



-log IIV^H 2 



2 lo S > 



Zn+l 



\2D 



D \lo t 



\Zl, 



Z n +1 1 
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So Qa is a multiple of the Fubini-Study metric, and we can actually extend 

n A t q +1 . 

Let / = fC~ l l 2 E (C M ) V represent the polynomial z ^ f(z) = fVA(z). 
Then evaluation corresponds to the pairing (/, z) t— > f ■ Va(z). The action 
of U(n) on C n+1 induces an action on (C M ) V . All these actions are unitary, 
and the Hermitian structure of the space of polynomials (in the coordinates 
above) is invariant under such actions. ■ 

For the record, we state explicit formula? for several of the invariants 
associated to the Kahler manifold (T", uja, J)- First of all, the function 
g A = g o v A is precisely: 



Formula 2.1.1: The canonical Integral gA (or Kahler potential) of the 
convex set associated to A 

9a{p) = ^ log ((exp(A • p)) T C (exp(A • p))) 



The terminology integral is borrowed from mechanics, and its refers to 
the invariance of gA by T"-action (see appendix [A] for more analogies). Also, 
the gradient of is called the momentum map. Recall that the Veronese 
embedding takes values in projective space. We will use the following 
notation: Va{p) = va(v) I\\va{p)\\- This is independent of the representative 
of equivalence class va{p)- Now, let va(p) 2 mean coordinatewise squaring 
and va(p) 2T be the transpose of va{p) 2 - The gradient of gA is then: 



Formula 2.1.2: The Momentum Map associated to A 

Vg A = v A {pf T A 



Since p h-> v a{p) is a well-defined real function, we may write its derivative 

as 

Dv A (p) = P VA{p) Diag(v A (p))A 

where P v is the projection operator / — j^W. 
Then the second derivative of g\ is 
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Formula 2.1.3: Second derivative of g A 

D 2 g A = 2Dv A (p) T Dv A (p) 



Using the relation — \dJ*dg A = \Yl(.D 2 g A )ijdpi A dqj, one obtains an 
expression for oj A . 



Formula 2.1.4: The symplectic 2-form associated to A: 
N(p,g) = \ S ^{D 2 g A ) ij dp i A 
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We still have to show that uj a is a symplectic form. Clearly, ^d(dJ*dg A ) = 
\d 2 (J*dg A ) = 0. The remaining condition to check is non-degeneracy. In 



view of formulae [2.1.3| and |2.1.4j , this is a consequence of the following fact: 



Lemma 1. Let A be a matrix with non-negative integer entries, such that 
Conv(A) has dimension n. Then (Dv A ) p is infective, for all p G M n . 

Proof. The conclusion of this Lemma can fail only if there are pel™ and 
u 7^ with (Dv A ) p u = 0. This means that 

P VA ( p )di&g(v A )pAu = . 

This can only happen if diag(t>^) p Au is in the space spanned by (v A ) p , 
or, equivalently, Au is in the space spanned by (1, 1, • • • , 1) T . This means 
that all the rows a of A satisfy au = A for some A. Interpreting a row of A as 
a vertex of ConvA, this means that ConvA is contained in the affine plane 
{a : au = A}. ■ 

We can also write down the Hermitian structure of T n as: 



Formula 2.1.5: Hermitian structure of T n associated to A: 
((u,w) A ) M = u H {^D 2 g A ) v w 
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In general, the function v A goes from T n into projective space. Therefore, 
its derivative is a mapping 



(Dv A ) M : T [p , q) T n - T^^^P^- 1 ~ t) A (p + qV-iy C C 

For convenience, we will write this derivative as a mapping into C M , with 
range + <?v / —T)~ L - Let P„ be the projection operator 



P, 



;VV 



H 



Then, 



Formula 2.1.6: Derivative of Va 



(Dv a ) 



(p. 9) 



Pa 



Diag ( rT .. I A 

\v A {P + 9V-1 



An immediate consequence of Formula 2.1.6 is 



Lemma 2. Lei / G JF4 and (p, g) G T n be such that f ■ Va(p + gy— T) = 0. 
Tnen, / • (.Dtu)(p,g) = ^^ {Dv A ) (p, q ) 

In other words, when (/ o exp)(p + q\J — 1) vanishes, -Dt>A and -Dtu are 
the same up to scaling. 

Notice that the Hermitian metric is also 

((u, w) A )(p iq ) = u h Dv A (p, q) H Dv A (p, q)w . 

Finally, the volume element associated to A is 



Formula 2.1.7: Volume element of (T n ,u) A , J) 

dT£ — det ( ^ D 2 g A {p) j dp 1 A ■ • • A dp n A dq 1 A ■ ■ • A dq n 
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2.2 Toric Actions and the Momentum Map 

The momentum map, also called moment map, was introduced in its mod- 
ern formulation by Smale [Sma70] and Souriau |Sou70|| . The reader may 



consult one of the many textbooks in the subject (such as Abraham and 
Marsden |[AM78|| or McDuff and Salamon ||MS98|| ) for a general exposition. 

In appendix we will explicitly show what Lie group action VgA is the 
momentum of, and what the associated Hamiltonian dynamical system is. 

In this Section we instead follow the point of view of Gromov [ |Gro9C . 
The main results in this Section are that 

Proposition 1. The momentum map VgA maps T n onto the interior of 
Conv(A). When VgA is restricted to the real n-plane [q = 0] C T n , this 
mapping is a bijection. ■ 

This seems to be a particular case of the Atiyah-Guillemin-Sternberg 
theorem. However, technical difficulties prevent us from directly applying 
this result here (see appendix [A]) . 

Proposition 2. The momentum map VgA is a volume-preserving map from 
the manifold (T n ,UA, J) into Coiw(A), up to a constant, in the following 
sense: if U is a measurable region of Conv(A) , then 

Vol ((V^r 1 ^)) = n n Volt/ . 
We prove Proposition |2| by first assuming Proposition [1]. 
Proof of Proposition |^. Consider the mapping 

M : T n -> iConv(A) x T n 

Since we assume dimConv(A) = n, we can apply Proposition [l] and 
conclude that M is a diffeomorphism. 

The pull-back of the canonical symplectic structure in IR 2 " by M is pre- 
cisely 10 Ai because of Formulae |2.1.3| and |2.1.4| . Diffeomorphisms with that 
property are called symplectomorphisms. Since the volume form of a sym- 
plectic manifold depends only of the canonical 2-form, symplectomorphisms 
preserve volume. We compose with a scaling by | in the first n variables, 
that divides Volt/ by 2 n , and we are done. ■ 
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Remark 6. Symplectomorphisms are also known to preserve a few other 
invariants such as the symplectic width (see |[MS98|| ). However, symplecto- 
morphisms are not required to preserve the complex structure and therefore 
need not be isometries. 

However, it is explained in UAbrOOfl how to define a new complex structure 
in Coiav(A) x T n that will make the map M a Kahler isomorphism, hence an 
isometry. ■ 

Before proving Proposition [I], we will need the following result about 
convexity. We follow here Convexity Theorem 1.2 in | Gro90 , attributed to 
Legendre: 

Theorem (Legendre). If f is convex and of class C 2 on ~R n , then the clo- 
sure of the image {Vf r '■ r G K n } in (R n ) v is convex. 

Proof. Let Lf be the set of covectors y G (IR n ) v with the property that 
3cGMViG R n f(x) >y-x-c . 

Notice that Lf is a convex subset of (R n ) v . Geometrically, the planes in 
Lf with c minimal correspond to the envelope of the graph of /. 

The set Lf contains {V/ r : r G MJ 1 }: For any given r, we set c r = 
V/ r • r — f(r). Since / is convex, 

f(x) > Vf r -X — Cr . 

For the converse, assume that there is y G Lf not in the closure of {V/ r : 
r G M. n }. Then there is some e > such that 

Vr G R n , \\y-Vfr\\ > e . 
We define the following gradient vector field in W 1 : 

. (y - v/,) T 

X = 

Because the denominator is bounded below by e, this field is well-defined and 
Lispchitz in all of K n . Let us fix an arbitrary initial condition x(0) G M. n , 
and let x(t) denote a maximal solution of the vector field. Since the vector 
field has norm 1, x(t) cannot diverge in finite time and therefore x(t) is 
well-defined for all i el. 

Now we look at the function 1 1— > y ■ x(t) — f(x(t)). Its derivative w.r.t. t 
is (y — V f x (t))x{t) > e. Therefore, lim^oo y ■ x{t) — f(x(t)) = oo. We deduce 
from there that sup r6R n y ■ r — f{r) = oo Hence, y G" Lf, a contradiction. ■ 
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By replacing / by g^, we conclude that the image of the momentum map 
Vqa is convex. 

Proof of Proposition The momentum map VgA maps T n onto the interior 
of ConvA Indeed, let a = A a be a row of A, associated to a vertex of ConvA 
Then there is a direction dgK" such that 



a ■ v = max x ■ v 

a; £ ConvA 

for some unique a. 

We claim that a G VgA(M n )- Indeed, let x(t) = VA(tv), t a real parameter. 
If b is another row of A, 

a-tv „ta'V ~tb'V „b-tv 



as t — > oo. We can then write VAitv) 2T as: 







T 




v A (tv) = 


gta-v 


CDiag 


^ta-v 



Since C is positive definite, C aa > and 

hm vaO) = hm ., ; . ... = e a — — = e a 



t^oo t— >oo 



\v A (tvW a C, 



aa 



where e a is the unit vector in M M corresponding to the row a. It follows that 
lim^oo Vg A (tv) = a 

When we set q = 0, we have det-D 2 gA ^ on M ra , so we have a local 
diffeomorphism at each point p 6 K" Assume that (V^a) p = (V<7aV f° r 
p 7^ p'. Then, let 7(t) = (1 — £)p + tp'. The function i i— > (VgA)y(t) r Y l {t) has 
the same value at and at 1, hence by Rolle's Theorem its derivative must 
vanish at some t* G (0, 1). 

In that case, 

(D'g A Ut^l'(t*),i(n)=0 

and since j'(t*) = p' — p ^ 0, det D 2 gA must vanish in some p G M. n . This 
contradicts Lemma [IJ ■ 
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Figure 1: Geometric interpretation of Dv A 

2.3 More Properties of the Momentum Map 

We can also give an interpretation of the derivative Dv A in terms of the 
momentum map (see figure |l|). 

Lemma 3. 

(Dv a A ) p u=\(v a A ) p \(A a -Vg A (p))-u . 

I (v a ) I 

where stands for jj^j^jp an d where A a and Vg A are co- vectors. 



Proof. By formula 2.1.6 



(Dv A ) p u = di&gv A (p)Au - v A (p)v A (p) T di&g {v A (p)) Au . 
Hence its a-th coordinate is: 

{Dvl) p u = (v a A ) p {A a u - (v A )l T Au) = (v a A ) p (A a - Vg A (p)) 



u 



Bearing in mind that v A (p) 2 = 1, we obtain an immediate consequence: 
Lemma 4. For all (p, q) G T n , 

^Dv A {p, < diam(Convyi) and 

\\ l -D 2 g A (p)\\ < (diam(ConvA)) 2 

22 



2.4 Evaluation Map and Condition Matrix 

In the setting of Theorem p], we can identify each space of polynomials 
(J 7 An (•> ^aJ to the (co)vector space (C Mi ) v , endowed with the canoni- 
cal inner product. The value of p at exp(p + q\/ — 1) is then precisely 

More generally, we can define the evaluation map by 



ev: (F Al x • ■ • x T An ) x T n 
{hi--- Jn,P + qV^l) 



f 1 ■ v Al (p + qyf-T) 
f n -v An (p + q^T) 



Following |PCSS98 |, we look at the linearization of the implicit function 
p + q\f—^ = G(f) for the equation ev(f,p + q\J—l) = 0. 

Definition 5. The condition matrix of ev at (f,p + q\/—l) is 

DG = D T n(ev)- 1 D T (ev) , 
where T = T Ax x • • • x T An . 

Above, D T n(ev) is a linear operator from an n-dimensional complex space 
into C n , while D T (ev) goes from an Mi + • • • + M — n-dimensional complex 
space into C n . 

Lemma 5. Assume that ev(f;p + qy—l) = 0. Then, 

det (DGDG 11 ) 1 dpiAdqiA- ■ -Adp n Adq n = (-i)™^" 1 )/ 2 /\ yf~~~lf-(Dv Ai ) {PA) dpA 

A P • (Dv Ai ) (p _ q) dq . 

Note that although /' • (Dv Ai ) (p q )dp is a complex- valued form, each wedge 
P-{Dv Ai ) [v , q) dpAP-{Dv Ai ) (p,- q )dq is a real-valued 2 form. 



Proof. We compute: 



DJev) 



\(p,g) 



Y%LA(p + qV^)dfl 

Ea=A(P + ^)df^ 
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and hence 



Also, 



D T (ev)D T (ev) H = diag \\v Ai 



'f 1 ■ Dv Al 
f n ■ Dv An 



Therefore, 



det (DG {p , q) DG? p J 



det 



\\VA-lW 1 



J \\VA n \\ An 



We can now use Lemma |2| to conclude the following: 



Formula 2.4.1: Determinant of the Condition Matrix 



H \ — * 



det (DG ip , q) DG? p J 



det 



r ■ Dv An 



We can now write the same formula as a determinant of a block matrix: 



f 1 ■ Dv Al 



det (DG^DG^y 1 = deb 



f n ■ Dv Ar> 



f 1 ■ Dv Al 



r ■ dv a j 



and replace the determinant by a wedge. The factor (— l) n ( n comes 
from replacing dpi A • • • A dp n A dqi A • • • A dq n by dpi A dqi A • • • A dp n A dq n . 
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Proof of Theorem |7|. Given (p, q) G T n , we define J-fa q ) as the space of / G 
•7"Ai x • • • x jF An such that ev(f;p + qy/—i) vanishes. 

Using [ |BCSS98| , Theorem 5 p. 243] (or Proposition | p. f|3] below), we 
deduce that the average number of complex roots is: 

Av S = / / ( II det (DG^DGlJ 1 . 

By Lemma |5j, we can replace the inner integral by a 2n-form valued 
integral: 

f f Hl/'ll 2 / 2 

Av g = (-ir(- 1 )/ 2 / / /\^^f(Dv Ai ) M dpA 

J<i>«)euJfeF M i (2vr) M - 

A f\Dv Ai ) (p ,_ q) dq . 

Since the image of Dv Ai is precisely ^Ailfeg) C one can add 

n extra variables corresponding to the directions v Ai (p + qV — 1) without 
changing the integral: we write T Ai = J- Ai ,(p,g) x Cu^(p + T). Since 
(/* + tv Ai (p + q\/—l)j Dv Ai is equal to f z Dv Ai , the average number of roots 
is indeed: 

Avg = ( I f\ • (Dv^dp A 

A f ■ {Dv Ai ) {p ,_ q) dq . 

In the integral above, all the terms that are multiple of f^fl for some 
a ^ (3 will cancel out. Therefore, 



A (Dv Ai )l- q) dq • 



Now, we apply the integral formula: 



|2 e / I |2 e 



rreC 



xi — — -i— = / bi = 2 

1 11 (2tt) m J XieC l 11 27T 
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to obtain: 



Avg 



/_]Wn-l)/2 



7T 



(P>9)et/ 



According to formulae |2.1.3| and |2.1.4j , the integrand is just 2 n f\ u Ai , and 



thus 



(l\n(n-l)/2 r 



7T 



ri! 



7T' 



3 The Condition Number 
3.1 Proof of Theorem |2] 

Let (p, q) G T n and let / G F( P ,q)- Without loss of generality, we can assume 
that / is scaled so that for all i, \\f l \\ = 1. 

Let 5f G Fop^) be such that / + 5f is singular at (p, g), and assume that 

|| 5f l || 2 is minimal. Then, due to the scaling we chose, 



H\\2 



Since / + Sf is singular, there is a vector u^O such that 
\f l + 5n-{Dv Al ) M 



and hence 



(r + 5r)-(z^j (M) 

(/ 1 + 5/ 1 )-( J Dt; Al ) (M) 
(fn + 5r)-(Dv An ) M 



u = 



it = . 



This means that 



Z 1 ■ Dv Al u 



-df 1 ■ Dv Al u 



f n -Dv An u = -5f n -Dv An u 
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Let D(f) denote the matrix 



D(f) 



def 



f 1 ■ (Dv Al ) M 

f n -(Dv An ) M 



Given v = D(f) u, we obtain: 

Vl = Sf 1 ■ Dv Al D(fy l v 

v n = -Sf n -Dv An D(f)~ 1 v 
We can then scale u and v, such that ||t> II = 1. 



(3.1.1) 



Claim. Under the assumptions above, 5f l is colinear to (Dv Ai D(f) v 

Proof. Assume that 5f l = g + h, with g colinear and h orthogonal to 
{Dv At D{f)~^) ■ As the image of Dv Ai is orthogonal to v Ai , g is orthogonal 
to v A ., so ev(g\ (p,q)) = and hence ev(h l ,(p,q)) = 0. We can therefore 
replace 5f l by g without compromising equality (|3.1.1|) . Since \\Sf\\ was 
minimal, this implies h = 0. ■ 

We obtain now an explicit expression for 5f l in terms of v. 



5f = -v, 



\\Dv Ai D{fYM\ 2 



Therefore, 



¥f l 



Vi 



WDv^DU^A \\{D{f)-H)\\ Ai 
So we have proved the following result: 
Lemma 6. Fix v so that ||t> || = 1 and let Sf G J 7 ^^) be such that equation 



( \3.1.]\ ) holds and \\5f\\ is minimal. Then, 

\vA 



\\sr 
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Lemma |G| provides an immediate lower bound for \\8f\\ = a/S ll^/*ll 2; 
Since 

¥f\\> 



max, \\D(f) x f IU, 
we can use IM| = 1 to deduce that 



/^T|| W > 1 , „ > 



maxj \\D(f) 1 v\\a 3 max,- \\D(f) 1 \\ Aj 

Also, for any v with \\v\\ = 1, we can choose 5f minimal so that equa- 
tion (13.1.1|) applies. Using Lemma || we obtain: 



\\sf\\ < 



min^- \\D(f) 1 v\\ Aj 
Hence 



'£11*/ 



,; || 2 < 



minj \\D(f) 1 v\\ Aj 
Since this is true for any v, and \\Sf\\ is minimal for all v, we have 



max||„|| = i miiij \\D(f) 1 \\ Aj 
and this proves Theorem 0. 

3.2 Idea of the Proof of Theorem § 

The proof of Theorem ^| is long. We first sketch the idea of the proof. Recall 
that Ffaq) is the set of all / £ T such that ev(f;p + g^/~T) = 0, and that 
is the restriction of the discriminant to the fiber J~{p, q )'- 

= {/ £ F{p,q) '■ D(f)(p tq ) does not have full rank} . 
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The space T is endowed with a Gaussian probability measure, with vol- 
ume element 

p -||/H 2 /2 

-dT , 



(2tt)£ m * 

where dT is the usual volume form in T = (J- An OaJ x • • • x (T An , (•, -)a„) 
and || /|| 2 = ^ 1 1/ 1 1 1 a - For ^ a set in T n , we defined earlier (in the statement 
of Theorem |]) the quantity: 

u A (U, e) d ^ Prob[/*(/, 17) > e" 1 ] = Prob[3(p, q) EU : d P {f, S (p>?) ) < e] . 

The naive idea for bounding z/ A ([/, e) is as follows: Let V{e) == 
{(/>(?>?)) £ TxU : ev(f;(p,q)) = and dp(/,E(p,,)) < e}. We also 
define 7r : V(e) — > .F as the canonical projection mapping T x U to J 7 , and 

set #v (e) (/) = #{(p,g) e U : (/, (p,g)) G V»}. Then, 



IP/2 



l|2/ 2 



< 



/■ e -n/ir/2 



with equality in the linear case. 



Now we apply the coarea formula |[BCSS98l Theorem 5 p. 243] to obtain: 



vA(U ' £) " ' / >%^> nam S^ dr m " 



where dVr« stands for Lebesgue measure in T n . Again, in the linear case, 
we have equality. 

We already know from Lemma [5] that 

n 

l/NJ{- (p, q)) = [\P- (Dv Ai ) M dp A f ■ (Dv Ai ) M dq . 
i=i 

We should focus now on the inner integral. In each coordinate space T Ai , 
we can introduce a new orthonormal system of coordinates (depending on 
(p,q)) by decomposing: 
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where /* is the component colinear to v A ., f* is the projection of /' to 
(range Dv Ai ) , and is orthogonal to /' and /*. 
Of course, f l G (^ij^q) if and only if fl = 0. 
Also, 

n 

A /* • ( Dv Aj( P , q )dp A f ■ (Dv A J M dq = 
i=i 

n 

= f\fl- ( Dv Aj( P , q )dp A /* • (Dv A J M dq . 
i=i 

It is an elementary fact that 
It follows that for / G J~( v , q )'- 
with equality in the linear case. Hence, we obtain: 



v A (U,e)< [ [ [\fi-{Dv A X, q) dp^fi-{Dv A X, q )dq 



vi=l 

g~ l/i+/nll 2 /2 



(2tt)£ m * 



dT dVr" , 



with equality in the linear case. We can integrate the ^(Mj — n — 1) variables 
/ M to obtain: 

Proposition 3. 

v A (U,e)< [ [ 2 (/\fi-{Dv Ai ) M d P Afi-(Dv Ai ) M dq 



'(p,g)e(7cT ; - 

*un,i(p,,))<E 



i=l 

> I 2 



( 27r )n(n+l) 

w?£/i equality in the linear case. ■ 



e -||/SH 2 /2 
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3.3 Prom Gaussians to Multiprojective Spaces 

The domain of integration in Proposition ^ makes integration extremely dif- 
ficult. In order to estimate the inner integral, we will need to perform a 
change of coordinates. 

Unfortunately, the Gaussian in Proposition |3| makes that change of coor- 
dinates extremely hard, and we will have to restate Proposition |3| in terms 
of integrals over a product of projective spaces. 

The domain of integration will be P n_1 x • ■ ■ x P™ . Translating an inte- 
gral in terms of Gaussians to an integral in terms of projective spaces is not 
immediate, and we will use the following elementary fact about Gaussians: 

Lemma 7. Let tp : C n — > R be C* -invariant (in the sense of the usual scaling 
action). Then we can also interpret if as a function from P n_1 into WL, and: 

If f e -\\A\ 2 lt 

— , . -7T / ip(x)d\x] = / ip(x)— — r — dx , 

where, respectively, the natural volume forms on P™" 1 and C n are understood 
for each integral. 

Now the integrand in Proposition ||| is not C* -invariant. This is why we 
will need the following formula: 

Lemma 8. Under the hypotheses of Lemma [J 

If If e -INI 2 /2 

ip(x)d[x] = — / ||a:|| 2 (/?(a;) — dx . 



Vol(P«+ 1 )i HeP „. 1 ^ v ' L 1 2nJ xeCn n 11 rv ' (2tt 

where, respectively, the natural volume forms on P n_1 and C n are understood 
for each integral. 



Proof. 

||s1|V0e) Y V dx = I I k| 2n+ V(Q) ^ v drdG 



-\\x\\ 2 /2 f roo p-|r| 2 /2 



( 27r ) n 7ees2«-i J r =o (27r) 



f ( e ~\r\ 2 /2 poo e -\r\ 2 /2 \ 

iees 2 «-i \ (270™ n Jr=o (2vr) n / 



o 
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e -IMI 2 /2 
2n I ipix)—. — : — dx 



We can now introduce the notation: 



n 

WEDGE A (/ E ) ^ f\ j—fl ■ (Dv Ai ) {m) dp A fi ■ {Dv Ai ) M dq . 
i=i 

This function is invariant under the (C*) n -action A * / E : / E i— > 
(Ai/j, • ■ • , A n /™). 

We adopt the following conventions: JF E c is the space spanned by 
coordinates / E and P(J r n ) is its quotient by (C*) n . 
We apply n times Lemma [8] and obtain: 

Proposition 4. Let VOL = f Vol(P n_1 ) n . Then, 



^p(/l,S( Pj(J ))<£ 



an<i in the linear case, 



J(p,q)€UCT ^ dp(9EiS Lin ))<£ 

Now we introduce the following change of coordinates. Let L G GL[n) 
be such that the minimum in Definition [I] p. § is attained: 

yj : P™- 1 x ■ ■ ■ x P"- 1 -> P 71 " 1 x • ■ ■ x P"" 1 

/h i-> ft = f <^(/i) , such 

that g\ = fi- Dv Ai L . 

Without loss of generality, we scale L such that det L — 1. The following 
property follows from the definition of WEDGE: 



WEDGE A (/ E ) = WEDGE Lin (g E ) \J (3.3.1) 
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Assume now that rfp(/ n , £(p, 3 )) < e. Then there is 5f E JF n , such that 
f + 5f E ^l n a ) and < e (assuming the scaling ||/*|| = 1 for all i). 



Setting g z = tp(f M ) and 5g = <p(g), we obtain that g + 8g E £ 



Lin 
0.9) ' 



ll^ll 2 



At each value of %, 



WW < ¥t\\ K(n ni . 



where k denotes Wilkinson's condition number of the linear operator Df^tp 1 . 
This is precisely k(Dva,L). Thus, 

d P (5f,£(^)) < emaxK(Dt) A ,L) = max a/^cuaJ 
Thus, an ^-neighborhood of ^ is mapped into a ^JktjE neighborhood 



of S Lin 



(.p,i) ' 

We use this property and equation ( |3.3.1| ) to bound: 



u A (U,e)< 



VOL 



A ' TT " - / w/r Lr-x-xr- WEDGED) 



,i 112 



• II ]feLl J ^ _1 | 2 ^P"" 1 x ■ ■ ■ x pn ^) ^ (3.3.2) 
i=l ll^ll 

where J 9E <^ _1 is the Jacobian of tp -1 at g n . 
Remark 7. Considering each Dt> ^. as a map from C n into C n , the Jacobian 



is: 



i=i 



We will not use this value in the sequel. 
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In order to simplify the expressions for the bound on is A (U,e), it is con- 
venient to introduce the following notations: 



n 1 1 1 1 2 

tj 4£. f TT llydl | t -li2 



, , Ml 

-{ 9 :d r (9,T.^ q) )<s} 

Now equation (|3.3.2[) becomes: 



def 

Xs = X) 



v A (U,e) <n\ f (^ Lin )A« f dP H(g % ) XVmM 

(3.3.3) 



Lemma 9. Let (p, q) be fixed. Then P™ 1 x ■ ■ • x P™ 1 together with density 
function dP, is a probability space. 

Proof. The expected number of roots in U for a linear system is 
n\ I a& n / dP . 

J(p,q)£U J gieP n - 1 x---xP n - 1 

It is also n\ J u oo^ i 2- This holds for all U, hence the volume forms are the 
same and 

dP = 1 . 

9neP n ~ 1 x---xP n - 1 



This allows us to interpret the inner integral of equation ( ft.3.3|) as the 
expected value of a product. This is less than the product of the expected 
values, and: 
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v A (U,e) <n\ f Kin) An ( [ dP H(g,)) ■ 

\J g I eP n - 1 x---xP™- 1 J 

Because generic systems of linear equations have one root, we can also 
consider U as a probability space, with probability measure L in nlu;^" ■ 
Therefore, we can bound: 



'(p,q)eU J g- i & n - 1 x---xP n - 1 



Vo\ Un U 



(f n\(uj Un ^ n [ d p Xv - E ( gn )\ 



The first parenthesis is Vol A (U). The second parenthesis is z/ Lin ( v //t[/£, U). 
This concludes the proof of Theorem [3[ 



Proof of Corollary \3. 1\ . We set L = Dv A \rangeDvAi, then 
k(u Ai ,--- ,u An ;(p,q)) = 1. ■ 



4 Real Polynomials 

4.1 Proof of Theorem f| 

Proof of Theorem f|. As in the complex case (Theorem |I|), the expected num- 
ber of roots can be computed by applying the co-area formula: 

AVG= / / TT- j^det{DG DG")- 1 ' 2 . 

Now there are three big diferences. The set U is in M n instead of T n , 
the space J 7 ^ contains only real polynomials (and therefore has half the 
dimension), and we are integrating the square root of 1/ det(_DG DG H ). 

Since we do not know in general how to integrate such a square root, we 
bound the inner integral as follows. We consider the real Hilbert space of 
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functions integrable in JF^ endowed with Gaussian probability measure. The 
inner product in this space is: 



*t r -A- e -n/ii 2 /2 

7=1 

where dV is Lebesgue volume. If 1 denotes the constant function equal 
to 1, we interpret 

AVG — [ (27t)-™/ 2 (detODG DG H y 1/2 , l) . 
Hence Cauchy-Schwartz inequality implies: 

AVG< [ (2ir)- n / 2 \\det(DG DG H )- 1/2 \\\\1\\ . 
By construction, ||1|| = 1, and we are left with: 



r c n p-II/ i II 2 /2 

AVG < / (2vr)-/ 2 / n_^_^det(DG£>G*)-i . 
^P6C/ \ J J* t=i V2tt 

As in the complex case, we add extra n variables: 



AVG<{2^ 2 / JjL^det(DGDG^)-i 

and we interpret det(.DG DG H )~ l in terms of a wedge. Since 

e -|NI 2 /2 /• e -J/ 2 /2 r e -y 2 /2 

N 2 /— — m = / y 2 —/tr = ~t^ = 1 ' 

l M V27T ^eM V^TT JygR V^TT 

we obtain: 

AVG < (2vr)- n/2 / v^WT^ = (27r)" n/2 f V^MT 1 . 

JpeU Jp& 
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Now we would like to use Cauchy-Schwartz again. This time, the inner 
product is defined as: 

(<P,iJ>) = / <p(p)il)(p)dV . 
Jpeu 

Hence, 

AVG < (2n)- n/2 (n\dT n ,l) < (27i)- n/2 \\n\dT n \\ . 
This time, ||1|| 2 = X(U), so we bound: 



AVG < (2ny n/2 y^{U)J J n\dT n 



< (4tt 2 )-"/ 2 v^(Z7) J / n\dT n 

'(p,q)eT n ,p£U 



4.2 Proof of Theorem |5] 

Proof of Theorem [|. Let e > 0. As in the mixed case, we define: 

MU,e) = Prob /6 ^ [ f i{f;U)>e- 1 ] 

= Prob /eF [Bp e U : ev(f;p) = and d r (f, E p ) < e] 

where now U G M n . 

Let V(e) d = {(f,p) E F R x U : ev(f;p) = and d F (f,E p ) < e}. We also 
define 7r : V{e) — > P(JF) to be the canonical projection mapping F^xU to 

F R and set #v (e )(/) = e V(e)}. Then, 

r e-EilirilVs 
i*(E/,e) = / ^ rM . X*(v( e ))(/) ^ 

f f e -S^I /l " 2 / 2 1 R 

dp(/,S P )<£ v 
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/ —^£M-#V{e)dF 



As before, we change coordinates in each fiber of J-\ by 

f = fi + U + fm 

with fl colinear to v^, (f^) T in the range of Dva, and /* othogonal to /* and 
/j. This coordinate system is dependent on p + qyf— T. 

In the new coordinate system, formula |2.4.1| splits as follows: 



det (DG {p) DGf p) ) 1/2 dV T n 



det 



det 



,(/n n )i 



(/n )«■ 

(/n )n 
(/n )n 

{fl )n 



det 



(Dv A Ti 
det Dv^Dva 



dV 



The integral E(U) of a/ det DvaDv 1 ^ is the expected number of real roots 
on U, therefore 



l (U,e)<E(U) 



■Zi\\fi+fi\\ 2 /2 



rfp(/n+/n,S p )<e 



det 



(/■") 



(/n )n 



In the new system of coordinates, £ p is defined by the equation: 

(/n)l • • • (/l)n 



det 



. 



Since ||/ n + / m || > ||/ n ||, 

dp(/i + /m, S p ) < £ d P (/ n , Sp) < e 
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This implies: 

MU,e)<E(U) 



e -£jl/i+/yi 2 /2 



dp(/n,[det=0])<e 



det 



(A 1 )! ••• (Z, 1 )* 



(fv)l • • • (/][% 

We can integrate the Mj — n — 1) variables / M to obtain: 
MU, e) = E(U) / 2 -5— det / E | 2 cflR" 

d P (/ n ,[det=0])< £ VZ7r 

This is E(U) times the probability u(n,e) for the linear case. 



5 Mixed Thoughts about Mixed Manifolds 

Let X : E — > F be a linear operator. Here, we assume that has a canon- 
ical Riemannian structure, and that F has n possibly different Riemannian 
structures (-,-) Ai - 

We would like to interpret the quantities 



and 



max min HX-y^ 
IMI<i j 



max max HXwH^. 

\\v\\<l j 



in terms of an operator norm on X. Let B denote the unit ball in E, and let 
Bi denote the unit ball in (F, (•, -) A .). Note that 



maxmaxllXulU. < f X(B) C Pl^ 

IMI<i j 11 " Al " ' ' 



while 
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maxmin||Xt;|| Ai < 1 X(B) C I J 5; 

\\v\\<l j ^ 



This should be compared to: 



min max HXu^ < 1 

i \\ v \\< 1 



3i : X(B) C Bi . 



One standard way to define norms is by choosing an arbitrary symmetric 
convex set, and equating that set to the unit ball. (Such norms are called 
Minkowski norms.) 

There are two immediate obvious choices: 

1. We can use f] Bi as the unit ball. 

2. We can use Conv |J Bi as the unit ball. 

In the first case, we can endow T n with a C° Finsler structure, while in 
the second case we can obtain a C l Finsler structure. 

Using Conv [J Bi would have the advantage of a known probabilistic 
bound for \i > e~ l . However, f]Bi seems to be more convenient for the 
study of polyhedral homotopy ||HS95| . 

Finsler structures are legitimate ways to endow a non-Riemannian mani- 
fold with a few familiar concepts. For instance, once we define a Finsler struc- 
ture HI - III 9 the length of a curve x(t), t E [0, 1] is defined to be \lx{t)\l x M dt. 



A general discussion on Finsler geometry can be found in ||CCL99| , Ch. 8]. 



Remark 8. The proof of Theorem |2| strongly suggests that the geometry of 
mixed manifolds should be determined by a much more fundamental invari- 
ant, a norm in the space L(C n , T( p ^T n ), which we can take to be either side 
of the following equality: 



max 

V 



s 



E 



,i\2 



\Xv\\% 



max 

u 



E 



l u \i\2 



\U\ 
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Remark 9. There is a class of polynomial systems that are not unmixed, 
but nevertheless can be treated as if they were unmixed. For instance, in 
the dense case, the potentials g^, i = l,-,n are all multiples of one an- 
other, therefore k = 1. The toric variety associated to those systems admits 
therefore a (possibly singular) Hermitian structure. That structure is non- 
singular provided that the A^s satisfy Delzant's condition [ Del88|| (see also 



Appendix [A| below). Roughly speaking, Delzant's condition is an assertion 
about the angle cones of the Minkowski sum of the Conv(Aj). ■ 

A Mechanical Interpretation of the Momen- 
tum Map 

The objective of this Section is to clarify the analogy between the geometry 
of polynomial roots and Hamiltonian mechanics. The key for that analogy 
was the existance of a momentum map associated to convex sets. 

In the case the convex set is the support of a polynomial, that momentum 
map is also the momentum map associated to a certain Lie group action, 
namely the natural action of the n-torus on the toric manifold T n : 

The n-torus T n = W 1 (mod 2vrZ n ) acts on T n by 

p ■ (p, q) ^(p,q + p) , 

where pGT". 

This action preserves the symplectic structure, since it fixes the p- 
variables and translates the q- variables (see Formula |2.1.1|) . Also, the Lie 



algebra of T n is W 1 . An element £ of W. n induces an infinitesimal action (i.e. 
a vector field) X% in T n . 

This vector field is the derivation that to any smooth function / asso- 
ciates: 

If we write df = d p fdp + d q fdq, then this formula translates to: 
(X c ) M (f) = -f(D 2 g A ) p d p f 



by Formula 2.1.4 
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This vector field is Hamiltonian: if (p(t), q(t)) is a solution of the equation 

(P(t),q(t)) = (X^ p(thq{t) 



then we can write 



where = V<7a(p) • £. 

This construction associates to every £ e M n , the Hamiltonian function 
H(. = ^9a(p) ■ £• The term V<7a(p) is a function of p, with values in (M n ) v 
(the dual of R n ). In more general Lie group actions, the momentum map 
takes values in the dual of the Lie algebra, so that the pairing V(?a(p) • £ 
always makes sense. A Lie group action with such an expression for the 
Hamiltonian is called Hamiltonian or Strongly Hamiltonian. 

The theorem of Atiyah, Guillemin and Sternberg asserts that under cer- 
tain conditions, the image of the momentum map is the convex hull of the 
points A a . One of those conditions requires that the action should take place 



in a compact symplectic manifold (as in ||Ati82| , Th. 1] or [|GS82| , Th. 4]) or 
(sometimes) a compact Kahler manifold ||Ati82j , Th 2]. 

We may consider a compactification of T n , such as the closure of exp(T n ). 
Unfortunately, there are situations where this compactification is not a sym- 
plectic manifold, because the form uj^ vanishes in the preimage of one or 
more A a, s (thus differing from the assumptions of [|Ati83|, p. 127]). 



In |pe!88||, a necessary and sufficient condition for the compactifica- 



tion of T n to be a symplectic manifold is given. Namely, the polytope 
Conv(A) should be simple (i.e., every vertex should be incident to exactly n 
edges) and unimodular (the integer points along the rays generated by each 
such n-tuple of edges should span Z n as a Z-module). If all the polytopes 
Conv(Ai), . . . , Conv(A n ) satisfy Delzant's condition then we can construct a 
corresponding compactifaction T n and apply Atiyah-Guillemin-Sternberg's 
theorem. 

Another possibility is to blow-up the singularities, as explained in ||MS98 



If we do so, polytopes Ai, • • • ,A n will be "shaved:" locally, the cone emanat- 
ing from a vertex will be truncated by the intersection with a half-space with 
boundary infinitesimally close to a supporting hyperplane. From another 
point of view, the underlying normal fan [[Ewa96fl will be refined. However, 
the relation between the original polynomial system and the new momentum 
map is not yet clear. 
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B The Coarea Formula 

This is an attempt to give a short proof of the coarea formula, in a version 
suitable to the setting of this paper. This means we take all manifolds and 
functions smooth and avoid measure theory as much as possible. 

Proposition 5. 1. Let X be a smooth Riemann manifold, of dimension 
M and volume form \dX\. 

2. Let Y be a smooth Riemann manifold, of dimension n and volume form 
\dY\. 

3. Let U be an open set of X , and F : U —>■ Y be a smooth map, such that 
DF X is surjective for all x in U. 

4- Let if : X — > R + be a smooth function with compact support contained 
in U . 

Then for almost all z G F(U), V z = F~ x (z) is a smooth Riemann mani- 
fold, and 

[ ip{x)NJ{F;x)\dX\ = [ [ ip{x)\dV z \\dY\ 

where \dV z \ is the volume element ofV z and NJ(F,x) = a/ det DF^DF X is 
the product of the singular values of DF X . ■ 

By the implicit function theorem, whenever V z is non-empty, it is a 
smooth (iV— n) -dimensional Riemann submanifold of X. By the same reason, 

V = {{z , x) : x G V z } is also a smooth manifold. 
Let 7] be the following iV-form restricted to V: 

r] = dY A dV z . 

This is not the volume form of V. The proof of Proposition ^ is divided 
into two steps: 

Lemma 10. 

I <p(x)\n\= [ <p(x)NJ(F',x)\dX\ 
Jv Jx 
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Lemma 11. 



<p(x)\ri\= / / ip{x)\dV z \\dY\ . 



Proof of Lemma [77]. We parametrize: 



Then, 



i>: X -> V 

x \— > (F(x),x) 



tp(x)\ri\ = / ((potp)(x)\ip*T]\ . 

V J X 



We can choose an orthonormal basis Ui, ■ ■ ■ ,Um of T X X such that 
u n +i, ker DF X . Then, 



Dtp(ui 

Thus, 



(DF x Ui,Ui) i = 1, • • • ,n 
(0,Ui) i = n + l,--- ,M 



\tfj*r](u u - ■ ■ ,u M )\ = \r](Di(mi, ■ ■ ■ , Dipu M )\ 

= \dY(DF x ui, ■ ■ ■ , DF x u n )\ \dV z (u n+1 , • • • , u M ) 
= | det DF x \ kevDF ±\ 



and hence 



NJ(F, x) 



ip(x)\r]\ = / (p(x)NJ(F;x)\dX\ 
v Jx 



Proof of Lemma [77j. We will prove this Lemma locally, and this implies the 
full Lemma through a standard argument (partitions of unity in a compact 
neighborhood of the support of <p). 

Let xo, zq be fixed. A small enough neighborhood of (xq, Zq) C V Zq admits 
a fibration over V Zo by planes orthogonal to ker DF Xo . 
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We parametrize: 



e, Y xv zo - v 



[z,x) i-> (z,p(x,z)) 



V 



where p(x, z) is the solution of F(p) = z in the fiber passing through 
(z ,x). Remark that 9*dY = dY, and 9*dV z = p*DV z . Therefore, 

6*(dY A dV z ) = dY A (p*dV z ) . 

Also, if one fixes z, then p is a parametrization V^ — > V 2 . We have: 

<^(x)|?7| = / y?(p(x,z))|r^| 



The proposition below is essentially Theorem 3 p. 240 of [pCSS98|| . How- 
ever, we do not require our manifolds to be compact. We assume all maps 
and manifolds are smooth, so that we can apply proposition [|. 

Proposition 6. 

1. Let X be a smooth M-dimensional manifold with volume element \dX\. 

2. Let Y be a smooth n-dimensional manifold with volume element \dY\. 

3. Let V be a smooth M-dimensional submanifold of X x Y , and let tti : 
V — > X and 7T2 : V — > Y be the canonical projections from X x Y to 
its factors. 

4- Let £' be the set of critical points of we assume that £' has measure 
zero and that £' is a manifold. 

5. We assume that 7t2 is regular (all points in ^(V) are regular values). 
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6. For any open set U C V , for any x G X, we write: == 
^ z {n^ 1 (x) fl U}. We assume that § x #y(x)\dX\ is finite. 

Then, for any open set U C V , 

[ #u(x)\dX\= I [ 1 =\dVA\dY\ 

Jx^m Jzer Vdet DG X DG» 

where G is the implicit function for (x,G(x)) G V in a neighborhood of 
(x,z) eV\E'.M 

Proof. Every (x, z) G U \ £' admits an open neighborhood such that 7i"i 
restricted to that neighborhood is a diffeomorphism. This defines an open 
covering of U \ Since U \ T! is locally compact, we can take a countable 
subcovering and define a partition of unity (<^a) AeA subordinated to that 
sub covering. 

Also, if we fix a value of z, then (v5a)aga becomes a partition of unity for 
^(Vr 1 ^) HU). Therefore, 



#u(x)\dX\ = X) / <px(x,z)\dX\ 



y I I <Px(x,z) 

' ey Jx,zeSv.ppip x (G, x) 



xeA 



1 \dX\ 



>z& Jx£V z NJ(G,x) 

where the second equality uses Proposition [| with = (p\/NJ. Since 
NJ = y/det DG X DG", we are done. ■ 
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